Number of samples optimized inside PPO together.